AD-A214  286 


nTF  fl'T  ^ 


Gaze  Controls 

with  Interactions  and  Delays 


Christopher  M.  Brown 

Technical  Report  278 
March  1989 


DTIC 


UNIVERSITY  OF 

ROCHESTER 


COMPUTER  SCIENCE 


Aacrorori  for  public  niiaaN; 
OltilbuHoo  UaHmttvd 


89  H  08  041 


SECU  Ri  Tv  Cl  AE  St  Fi  C  A  t  :  ON  Or  th:?  page  ' 


REPORT  DOCUMENTATION  PAGE  bef*  kf lu ^ngVo* .. 

t  REPORT  number  ,2  GOVT  ACCESSION  NO 

27S  ! 

1 

3  REC'PiEN^’S  CATALOG  NU MS?F_e 

4  TITLE  (end  Subtitle' 

Gaze  Controls  with  Interactions  and  Delays 

5  Type  OF  prp^pi  $  PER.' DO  r  C  F  ^  F  2 

Technical  Report 

€  PERFORMING  DPO  REPORT  N„M?Ep 

7 .  au  T  hoR;  a) 

Christopher  M.  Brown 

8  CONTRACT  OR  Grant  numbE«  < 

DACA76-85- C-0031 

9  PERFORMING  ORGANIZATION  NAME  AND  ADDRESS 

Computer  Science  Department 

734  Computer  Studies  Bldg 

University  of  Rochester.  Rochester,  NY  14627 

10.  PROGRAM  E  l  Em  E  n  t  PROJ  E  C  -  .  task 

AREA  ft  WORK  UNIT  NUMBERS 

It.  CONTROLLING  OFFICE  NAhJE  AND  ADDRESS 

D.  Adv .  Res.  Proj .  Agency 

1400  Wi 1  son  61 vd 

Arlington,  VA  22209 

1 2  REPORT  DATE 

March  1989 

13  number  of  PAGES 

19 

14  MONITORING  AGENCY  NAME  8  AODBESSfll  di  lie  rent  from  Contrclllnt  Office) 

US  Army  ETL 

Fort  Bel  voir,  VA  22060 

15  SECURITY  CLASS  fot  this  repcrt; 

Unclassif i ed 

IS*  DEC  _  ASS:  FlC  ATi  ON  DOWNGRADING 
SCHEDULE 

J. - - - - - - - — - - 

'6  DlSTRtBU7lON  STATEMENT  lot  thin  Report; 


Distribution  of  this  document  is  unlimited. 


)  7 .  DISTRIBUTION  STATEMENT  (cl  the  absfraef  entered  in  Block  2C.  if  different  from  Repor*' 


18.  supplementary  notes 

None . 


19.  KEY  WORDS  (Continue  cn  reverse  aide  if  necessary  and  Identity  by  block  number) 

predictive  control,  active  vision,  gaze  control 


20  ABSTRACT  (Continue  on  reverse  side  li  necessary  and  Identify  by  block  number) 

Five  control  systems  loosely  corresponding  to  primate  saccadic,  vergence, 
pursuit,  vesti bul o-ocul ar ,  and  head  control  operate  on  a  simulated  two-eyed 
robot  head  maneuvered  by  a  robot  arm.  The  goal  is  to  get  ^ome  qualitative 
understanding  of  the  interaction  of  such  reflexes  under  various  assumptions. 
The  simulation  is  meant  to  be  relevant  to  U.  Rochester's  robot.  Thus  it  in¬ 
corporates  kinematics  of  the  robot  head  but  assumes  a  "tool -coordinate"  sys¬ 
tem  available  to  robot  arm  commands,  so  that  arm  kinematic  calculations  are 
unnecessary.  Dynamics  are  not  modeled,  since  they  are  handled  by  the  commer- 

DD  ,  ’™n  1473  ~omNov  65  is  obsolete  unclassified 


SECURITY  CLASSIFICATION  of  This  page  Pc;  Entered 


20.  ABSTRACT  (Continued) 

ciai  controllers  currently  used  in  the  Rochester  robot.  Even  snail  delays  render 
the  effect  of  delay-free  controllers  unstable,  but  multi-delay  version  o"f  a  Smith 
predictor  can  cope  with  delays.  If  each  controller  a  ts  on  the  predicted  system 
and  ignores  other  controllers,  the  ^si tuaticn  is  improved  but  still  potentially  un¬ 
stable  if  controllers  with  different  delays  act  on  the  same  control  output.  The 
system's  performance  is  much  improved  if  controllers  consider  the  effect  of  other 
controllers,  and  the  resulting  system  is  stable  in  the  presence  of  a  certain  amount 
of  stochastic  disturbance  of  control  delays  and  inputs,  and  also  in  the  presence 
of  systematic  error  arising  from  inaccurate  plant  and  world  models.  !y 


QUM- 

iNsnrru-. 


Aooosslon  For 

NTTS  GFA&I 
DT1C  TAB 
Unannounced 
Justlf isMtion. 


By - - 

Distribution/ 

Availability  Codas 
Avail  and/or 
Dlst  Special 


GAZE  CONTROLS  WITH  INTERACTIONS  AND  DELAYS 


Christopher  R row n 

Computer  Science  Deport mem 
Pniversity  of  Roc|>'.  si  <  r 
Rochester,  NV  HC>27 


ABSTRACT 

Five  con  t  rol  systems  loosely  corresponding  to  primate  saccadic,  vergence.  pursuit .  vest  ituilo-  ocular,  and  h"ad 
control  operate  on  a  simulated  two-eyed  robot  head  maneuvered  by  a  robot  arm  The  goal  is  to  get  souk 
qualitative  understanding  of  the  interaction  of  such  reflexes  under  various  assumptions  The  simulation 
is  meant  to  be  relevant  to  f.  Rochester's  robot  Thus  it  incorporates  kinematics  of  the  robot  head  but 
assumes  a  "tool-coordinate”  system  available  to  robot  arm  commands,  so  that  arm  kinematic  calculations 
are  unnecessary  Dynamics  are  not  modeled,  since  tiny  are  handled  by  the  commercial  controllers  currently 
used  in  the  Rochester  robot  Even  small  delays  render  the  effect  <-,f  delay-free  controllers  unstable,  but 
multi-delay  version  of  a  Smith  predictor  ran  cope  with  delays  If  each  controller  acts  on  the  predict- d 
system  and  ignores  other  controllers,  the  situation  is  improved  but  still  potentially  unstable  if  controllers  with 
different  delays  a-t  on  the  same  control  output  The  systems  performance  is  much  improved  if  coi,t -oilers 
consider  the  effect  of  ot  her  c  out  rollers,  and  the  resulting  system  is  stable  in  the  presence  of  a  certain  amount 
of  stochastic  disturbance  of  control  delays  and  inputs,  and  also  in  the  presence  of  systematic  error  arising 
from  inaccurate  plant  and  world  models. 


INTRODUCTION 


Behaving,  actively  intelligent  ( mechanical  or  biological)  systems  must  manage  their  computational  and  ph\«- 
ical  resomces  in  appropriate  ways  in  order  to  survive  and  to  accomplish  tasks.  At  Rochester  we  are  building 
an  integrated  actively  intelligent  system  that  incorporates  abstract  reasoning  (planning),  sensing,  and  acting 
[RroKRj.  'I  Im  active  intelligence  paradigm  we  shall  exploit  incorporates  the  following  ideas 

1  A  hierarchy  of  control,  so  that  the  highest  cognitive  levelscan  reason  in  terms  of  i  chat  they  want  done 
rather  than  how  to  do  it  in  detail.  This  hierarchy  should  extend  throughout  the  system 

2  At  the  lower  levels,  the  control  hierarchy  ends  with  visual  an-’  r  skills  or  reflexes.  These  capabilities 
are  cooperative  but  to  some  extent  independently  control.,  do  "ome  are  always  running,  and  thev 
form  the  building  blocks  on  which  more  complex  behavior  is  .  Examples  are  tracking  targets  to 
minimize  motion  blur  or  redirecting  gaze  as  a  result  of  attentional  shifts. 

3  Part  of  the  job  of  low-level  visual  capabilities  is  to  present  perceptual  data,  such  as  flow  fields  or 
depth  maps,  to  higher-level  visual  processes  Low-level  processes  can  often  benefit  from  knowledge  of 
self-initiated  motion  on  the  part  of  the  sensing  entity.  They  can  often  be  built  on  the  low-level  control 
capabilities 

We  vui._:.t!y  hive  a  nine  degree  of  freedom  robot  body-head  combination  controlled  by  a  Sun  computer 
interfaced  over  a  serial  line  a  VALTI  cohot  control  system,  and  over  a  VME  bus  to  the  three  eye  motor 
controllers  The  visual  input  is  processed  by  a  pipelined  ima^v  prcvessjne  system.  The  system  has  been 
used  in  several  promising  demonstrations  of  considerable  complexity  in  depth-map  creation  and  vergence 


i 


([BOSR.OP*f*;).  It  ha«  also  hern  us-d  for  some  simple  hut  effective  real-time  applir-,t  inn«.  m  tra4;mc  an  I 
fixat  ion. 

What  has  been  missing  so  far  lias  been  the  cooperation  of  several  modes  of  control.  °r  the  operation  of 
several  at  once  In  the  work  reported  below,  a  simulation  of  the  robot  head  and  e\ c-s  is  used  to  examine  the 
effects  of  different  styles  of  interaction  between  certain  control  capabilities  that  we  have  implemented  (su  1. 
as  tracking)  or  anticipate  using  (such  as  using  eye  movements  to  compensate  for  head  movements) 

Tit"  simulation  software  is  based  on  the  actual  robot  head  kinematics,  and  has  provided  a  flexible  tool  ft 
investigating  the  interaction  of  different  control  methods  and  different  types  of  control  interaction 


THE  MODEL  OF  HEAD  AND  IMAGING 

The  simulator  geometry  can  capture  all  the  essentials  of  the  Rochester  robot  [BroRfl.RR^R]  (including  the 
annovinc  "lion-spherical"  geometry  of  the  camera  pans  and  tilts).  It  allows  geometric  parameters  to  I  e 
changed  to  explore  the  effects  on  error  and  the  possibility  of  adaptative  control  The  robot  arm  is  not 
modeled:  rather  the  model  abstracts  it  to  a  single  eye-support  platform  that  can  be  postioned  arbitrarih  in 
space  with  six  degrees  of  freedom:  three  in  position,  three  jn  orientation  On  the  model  head  i«  a  modelled 
tilt  capability  that  affects  both  cameras,  and  each  camera  has  a  modelled  pan  capability  The  geometry 
of  the  offsets  of  the  various  axes  in  these  links  are  variable,  and  incorporate  the  geometrical  complexity  of 
the  real  system.  The  simulated  mechanism  is  massless;  this  reflects  the  effective  behavior  of  our  current 
hardware  system  when  viewed  from  its  high-level  control  operations.  The  independent  control  of  the  camera 
pans  allows  us  to  model  modern  theories  of  saccadic  and  vergence  systems;  heads  with  mechanical  verge-ice 
capability  need  one  fewer  motor  but  must  use  older  models  of  these  systems. 

The  camera  models  incorporate  point  projection  with  fixed  focal  length,  as  wp|]  as  a  ’’foveal-poriphera]" 
distinction  by  which  the  location  of  imaged  points  is  less  certain,  outside  a  small  fovea)  region,  depending 
on  the  ofT-axis  angle  0f  the  target  being  imaged  The  target  itself  is  a  single  point  in  3-D  space,  moving 
under  dvnamica)  laws.  T  he  experiments  below  were  often  carried  out  with  the  target  point  in  orbit  about  an 
invisible  "black  hole"  -  thus  the  target  followed  an  elliptical  path  In  other  experiments  the  target  moved  in 
a  straight  line.  In  some  of  the  experiments  involving  delays  the  target  was  stationary  but  the  robot  moved 
in  X.  V.  and  Z.  thus  creating  a  perceived  target  motion,  but  one  due  to  factors  under  robot  control 

It  is  assumed  that  the  imaging  system  knows  the  distance  to  the  target  (in  mal  life,  tins  distance  may  he 
derived  from  binocular  stereo,  apriori  knowledge,  any  of  a  number  of  monocular  distance  cues,  kinetic  depth 
calculations,  etc.)  It  is  assumed  that,  for  eacli  eye,  the  instantaneous  retinal  velocity  of  the  target  is  known 
(i  e  the  vector  difference  between  its  position  in  the  current  image  and  its  position  in  the  last  image)  Other 
than  (hat,  the  system  only  knows  the  left  and  right  image  (x,y)  location  of  the  target's  image.  Of  course  the 
target  's  image  position  and  hence  image  velocity  is  perturbed  by  uncertainties  arising  from  the  blurriness 
of  peripheral  vision,  should  the  target  not  be  foveated.  There  is  a  further  provision  to  add  uniform  noise  to 
the  target's  imaged  position  -  this  can  model  quantization  noise,  or  be  used  to  approximate  process  noise  in 
the  target 's  mot  ion 


THE  MODEL  OF  CONTROL 


ZERO  DELAY  CONTROL 

The  input  to  the  control  systems  is  usually  based  on  quantities  that  can  he  inferred  from  vision  (e  g  the  (x.v) 
position  of  the  target,  which  should  be  driven  to  (0,0),  or  target  disparity  bet  ween  the  two  eves  which  should 


2 


be  driven  to  0).  Some  control  input?  arise  from  the  robot's  "proprioception"  (e  g  the  amount  the  cameras 
are  panned  or  tilted  from  their  null  position),  and  some  is  from  other  control  signals  (when  one  control  is  to 
null  out  the  efTects  of  another).  The  simulation  has  controllable  output  parameters  corresponding  to  one  set 
of  VAL-II  robot  control  parameters  (the  YAL-II  "tool  coordinate  system")  for  the  head  its  X.Y.Z  position 
and  A.B.C  orientation  Also  there  is  direct  control  over  the  pans  (independent  for  left  and  right)  and  tilt 
(common)  of  the  two  cameras  In  every  case  the  outputs  of  controls  are  velocity  commands  to  the  nine 
degrees  of  freedom  in  the  system,  reflecting  one  simple  form  of  our  current  interface  to  the  motor  controllers 

The  basic  control  loops  that  manage  the  system  are  loosely  inspired  by  the  prinr‘“  visual  system  However, 
most  assumptions  and  technical  decisions  have  been  made  eithpr  for  the  sake  of  simplicity  or  to  mimic  our 
robot  rather  than  for  the  sake  of  faithfully  modelling  known  biological  systems  or  optimal  mechanical  systems 
(see  the  Discussion  section  below).  Still,  one  of  the  major  design  goals  is  that  the  system  can  support  more 
detailed  control  models  Most  of  the  loops  have  several  parameters,  such  as  the  proportional,  integral,  an  1 
derhative  (FTD)  constants  of  their  controllers,  and  their  delays  and  latencies.  Delay  means  the  amount  of 
time  after  a  commanded  motion  before  it  commences  —  this  is  often  called  latency  in  the  literature.  Latency 
is  how  long  it  takes  the  command  to  complete:  it  is  another  time  constant  that  indicates  both  how  soon 
another  command  can  be  accepted,  or  how  long  the  command  will  be  afTecting  the  controlled  (velocity) 
variables.  In  all  the  work  so  far.  only  saccades  have  latency  greater  than  unity  In  the  robot  system  the 
delay  correponds  to  how  long  it  takes  the  mechanical  system  to  respond  to  a  motion  ordered  from  a  high 
software  level,  and  the  latency  reflects  how  long  it  takes  to  complete  a  command  The  assumption  is  of 
control  delay,  not  sensor  delay:  that  is,  we  assume  that  "sensors"  (visual  or  robot-  and  eye-control  motor 
states  read  from  their  controllers)  are  available  to  the  system  immediately,  without  delay,  and  thus  reflect 
the  true  state  of  the  world.  (Our  analysis  and  (he  algorithms  extend  to  the  case  that  the  sum  of  control  and 
sensor  delays  is  constant  for  any  controller.) 

There  are  five  separate  control  systems 

1.  Saccade:  fast  slewing  of  cameras  to  point  in  commanded  direction  Saccades  are  modelled  as  open 
loop,  though  in  primates  there  are  "secondary"  saccades  that  correct  errors  in  initial  saccades.  The 
saccadic  system  tries  to  foveate  the  target  and  to  match  eye  rotations  to  the  target  velocity  so  as  to  be 
tracking  the  target  as  soon  as  the  saccade  is  completed.  Current  opinion  is  that  the  saccadic  system 
is  aware  of  the  3-D  location  of  the  target,  not  just  the  location  of  its  retinal  image  However,  in  the 
implementation  used  for  the  experiments  below,  saccades  operate  with  retinal  locations  and  velocities, 
not  3-D  locations  or  distance.  The  left  eye  is  dominant  in  the  system.  The  saccade  aims  to  center 
the  target  image  on  the  fovea  of  the  left  eye;  (he  right  eye  is  panned  by  the  same  amount  (and  of 
course  tilted  by  the  same  amount  for  mechanical  reasons)  Thus  the  saccade  maintains  the  current 
vergence  angle.  It  is  implemented  as  a  constant-speed  slewing  of  all  three  pan  and  tilt  axes,  with  one 
of  them  attaining  a  system  constant  maximum  velocity.  The  slewing  continues  until  the  target  should 
be  foveated  (it  my  not  be  due  to  peripheral  blurring  or  other  noise),  at  which  time  the  system  is  left 
with  eye  velocities  that  match  the  perceived  target  motion  before  the  saccade.  The  saccadic  system  is 
characterized  by  its  maximum  velocity  and  its  delay 

2.  Smooth  Pursuit:  tracking  a  moving  target.  This  is  a  ’’continuous"  activity  as  opposed  to  the  discon¬ 
tinuous  saccadic  control  activity.  The  error  here  is  target  position  in  the  left  eye,  (which  should  be 
(0,0)),  and  the  commands  are  pan  and  tilt  velocities  to  the  left  eye.  The  pursuit  system  has  delay, 
latency,  and  PID  control.  In  both  the  saccadic  and  smooth  pursuit  systems  modeled  here,  there  is 
strict  (exclusive)  left-eye  dominance. 

3.  Vergence:  the  vergence  system  measures  horizontal  disparity  between  the  target  position  in  the  left 
and  right  eyes,  and  pans  the  right  eye  to  reduce  it.  The  vergence  system  ha?  delay,  latency,  and  TID 
control. 

4  VestibuioOcular  System:  the  YOR  system  is  open  loop  in  the  sense  that  its  inputs  comp  from  the 
head  positioning  system  and  its  outputs  go  to  the  eye  positioning  system  Its  purpose  is  to  stabilize 


3 


eves  against  head  motion,  and  it?  inputs  am  the  control  signals  for  head  posit  hm  ( XYZ  vf-l.-^it  i--?.  ABC 
angular  \»loritir?)  ]t  also  usee  the  distance  of  the  target.  since  that  affects  (Ik  appropria.*-  r'-?poncr 
The  \'OR  should  ideally  Be  implemented  by  inverse  kinematics,  to  which  the  current  implementation 
(and  presumably  the  neural  one)  i«  an  approximation.  Its  output  is  commands  to  the  pans  and  tilt 
controls  to  null  out  the  apparent  target  mot  ion  caused  by  head  motion  It  is  characterized  by  delay 
latency,  and  open  loop  proportional  gain. 

5  Platform  Compensation:  This  system  is  a  head-control,  not  gaze-control  system  These  systems  are 
known  to  interact  in  subtle  and  complex  ways,  but  this  particular  reflex  simply  attempts  to  keep  the 
eyes  "centered  in  the  head  ",  so  that  the  camera  pans  or  t ilts  are  kept  within  “comfortable"  mechanical 
ranges  The  "comfort  function"  is  a  nonlinear  one  x/((t  —  xmax)').  where  x  is  the  average  pan  angle 
(to  control  head  "yaw”  movements)  or  the  tilt  angle  (to  control  head  "pitch"  movements).  In  either 
case  x”:jx  i:  the  mechanically  imposed  limit  of  the  system  This  reflex  is  open  loop  (eye  position 
afTects  head  position),  with  delay,  latency,  and  open  loop  proportional  gain 

The  system  lias  the  capability  of  operating  in  two  modes:  srnooth  pursuit  and  saccade  In  smooth  pursuit 
mode,  die  YOU.  platform  compensation,  pursuit,  and  vergence  systems  are  left  running  In  saccade  mndr. 
other  controls  may  be  diahied.  This  allows  modelling  the  effects  of  turning  off  vergence.  head  compensation, 
tracking,  dr.  during  saccades.  Ultimately  it  seemed  best  only  to  turn  off  tracking  during  saccades.  hut 
other  combinations  are  demonstrated  below. 

1  he  delays  and  latencies  are  implemented  with  a  command  pipeline,  in  which  the  commanded  changes  jn 
velocit  ies  are  entered  op  posit  e  the  time  in  the  fut  tire  they  are  to  take  effect  Time  js  d  iscret  ized  t  o  some  level, 
called  a  tick  hencefort h.  A  larger  delay  results  in  entry  of  t he  corresponding  command  further  in  the  future 
Latencies  are  implemented  by  dividing  the  commanded  change  between  as  many  discrete  time  periods  as 
necessary  to  spread  the  effect  over  the  latency.  The  pipeline  thus  is  indexed  by  (future)  time  instant,  and  it 
lias  entries  that  hold  the  commanded  velocities  for  the  six  head  degrees  of  freedom  and  three  camera  degrees 
of  freedom.  Each  instant  also  has  an  entry  corresponding  to  its  mode  (saccadic  or  pursuit).  The  pipeline  is 
implemented  as  a  ring  buffer. 

For  th°  delay-free  case,  the  control  architecture  is  strictly  independent.  That  is,  controllers  are  ignorant 
of  each  other's  effects,  and  the  combination  of  control  effects  is  modeled  by  all  controllers  incrementing 
or  decrementing  a  common  control  register  (indicating  some  motor  velocity  setting).  All  increments  and 
decrements  are  made  to  t Ire  current  value  that  is  there  already,  which  perhaps  is  nonzero  because  of  input 
from  another  reflex  Thtis  (lie  control  commands  are  summed  in  the  simplest  possible  way.  as  if  each  control 
sy  stem  s  output  were  a  D  C.  voltage  and  all  the  outputs  were  soldered  together  at  the  effector  motor's  input 

The  saccadic  system  shuts  down  (he  pursuit  system  in  the  sense  that  for  the  duration  of  tlm  saccade  (which 
is  computed  from  the  image  distance  it  must  move  the  fovea  and  the  maximum  velocity  it  can  move),  all 
ot her  commands  in  the  pipeline  are  overwritten,  and  the  mode  is  changed  to  "saccade".  Further  commands 
trying  to  affect  these  instants  may  be  ignored,  depending  on  the  (compile-time)  policy  desired 

NON-ZERO  DELAY  CONTROL 

Slight  amounts  of  delay  destabilized  the  simulated  system,  as  expected  (see  the  Experiments  section  below) 
Control  with  delays  can  be  stabilized  by  turning  down  gains  and  slowing  the  response  of  the  svstem,  but  its 
performance  then  suffers.  Successful  control  with  delays  incorporates  some  form  of  prediction  [Mar79]  The 
controller  implemented  in  the  simulation  is  a  version  of  a  Smith  predictor  [Smi57,Sini58],  which  is  the  basic 
idea  behind  most  modern  methods 

Smith 's  Principle  is  that  the  desired  output  from  a  controlled  system  with  delay  p  is  the  same  as  that  desired 
from  the  delay-free  system,  only  delayed  by  the  delay  p  Let  the  delay  he  :~r,  the  uelay-free  series  controller 
be  C(x).  t he  desired  delay  controller  be  C(z)  and  the  plant  be  A(z)  The  delay-free  system  transfer  function 
will  be 


4 


C  A 


1  4  C  A 


The  delay  system  with  itc  desire:!  controller  lias  transfer  function 

(A:~r 
1  4  CA:~T 


But  Smith’s  Principle  is 


CA:~r  _  CA:~r 
1  4  CA:~r  ~  1  4  C A 


'Iliis  quickly  leads  to  the  specification  for  the  controller  C  in  terms  of  C.  A.  and  :~r 

..  _  C 

~  1  4  CA I  1  - 

Thi«  simple  principb  has  spawned  a  number  of  related  controllers,  often  amine  from  each  other  by  simple 
Mock-ciiacram  manipulat ion  Figure  1  is  one  block  diagram  of  a  Srnit  h  prediction  controller,  and  it  describes 
the  implemented  system  m  the  simulator. 

If  the  maximum  delay  of  a  controller  in  the  system  is  T.  The  plant  model  is  a  pipeline  of  enough  future 
robot  states  to  reach  time  T  into  the  future,  updated  and  extended  once  a  tick.  Ideally  the  robot’s  state  is 
predictable,  since  only  the  control  commands  act  on  it  Practically  there  may  be  some  plant  noise  Jn  the 
work  so  far.  the  world  prediction  is  simplified  by  assuming  the  world  is  static  and  that  the  robot  does  all  tin* 
moving  (navigation  in  a  static  environment).  As  part  of  the  experiments,  target  motion  was  added  to  test 
the  system’s  response  to  a  fa  I  s.-  target  model 


EXPERIMENTS 

DELAY-FREE  CONTROL 

In  all  the  simulations,  the  goal  of  the  system  is  to  put  one  or  both  of  its  eyes  squarely  on  the  target  (at 
retinal  position  (0,0))  and  keep  them  there.  The  head  is  always  in  an  upright  position,  so  pans  rotate  the 
cameras  about  a  vertical  world  axis,  tilts  rotate  the  cameras  about  a  horizontal  axis  With  a  static  head, 
pans  induce  image  x  motion  upon  a  static,  foveated  target  and  tilts  induce  image  y  motion  In  all  the  graphs 
of  this  section,  the  horizontal  axis  is  time,  and  the  vertical  axis  is  pan  and  tilt  error,  or  equivalently  the 
image  x  and  y  position  of  the  target.  Each  graph  shows  both  left  and  right  eye  x  and  y  errors,  but  often 
the  y  errors  are  superimposed  since  the  tilt  platform  is  common  to  both  cameras  In  every  case  there  is 
’’peripheral  blur”,  which  is  modelled  by  adding,  outside  a  small  "fovea",  uniform  noise  to  the  target  ( x,y ) 
location,  with  standard  deviation  proportional  to  1  /d,  where  d  is  the  euclidean  distance  of  (r,ty)  from  the 
(0.0)  point  The  simulation  does  not  use  realistic  lime-constants  and  speeds,  which  instead  are  scaled  so 
that  interesting  effects  happen  within  a  few  licks 

Fig^  2  and  3  illustrate  the  cumulative  effect  of  simply  superimposing  control  capabilities:  each  operates 
independently  and  tlK-ir  outputs  are  simply  summed  at  the  effectors.  Delays  are  zero,  latencies  (except  for 
sacrades)  unity  ltt  these  two  figures  trarkinc  is  by  position  error  signal 


5 


Figure  1:  The  implemented  Smith  predictor  control  The  block  diagram  is  easily  derived  from  the  Smith  predictor 
equation.  wi,|t  the  MODF.L  PLANT.  MODEL  world,  and  MODE1  SENSOR  blocks  corresponding  to  A  C  i<= 
represented  bv  the  block  labelled  CONTROL  and  everything  below  the  dashed  line  The  CONTROL  block  represents 
all  five  control  systems,  and  the  DELAY  block  represents  a  vector  of  their  five  independent  delays  The  PLANT. 
WORLD,  and  SENSOR  blocks  represent  the  robot  simulation  Delased  control  is  implemented  with  a  pipeline  of 
controls  to  take  place  in  the  future,  and  the  plant  model  is  a  similar  pipeline  of  predicted  robot  states  derived  from 
the  control 


6 


(b  I 


[  •  ~r  3 


(d) 


Figure  2  Increasing!  v  effect  i\  e  delas  -bee  con  I  io|  results  from  superposi  l  ion  c»f  nonin  I  era  cl  ing  ronl  rnlle  rs  t  a  1  I  ra '  k  - 
ing  only:  I  lie  left  (dominant)  eye  pans  and  tilts,  inducing  tilt  in  the  right  eye  The  tracker  us-s  a  position  error 
signal  Tile  riglit  eye  gets  no  pan  signal,  and  its  horizontal  error  accrues  irom  target  motion  The  left  eye  tracks 
successfully  until  it  hits  mechanical  stop  at  tick  14  (Id  Add  vergence:  Doth  eves  hit  slops  at  about  tick  15  (ft 
Add  head  compensation:  This  control  is  to  keep  eyes  from  hitting  mechanical  stops  by  turning  the  head  in  the  same 
direction  as  the  tracking  motion  A  less-desirable  effect  is  to  amplify  the  tracking  signal,  overcompensating  and 
destabilizin'!  he  tracking,  (d)  Add  VOR.  which  effectively  compensates  the  head  rotation  with  eve  rotations 


7 


-I1  J 


I  i  C I r •  3  |  a  t  ( "on  t  ir.  in  np  1 1.'  prr\  imic  ft  pur'-  wit  ti  t  rack me  (1  ii vc n  by  position  error.  tub!  sa'cad'  s  m  w  )ik Ii  settn'"' 
Y('K  am!  lea.!  mmpoiisation  ar<  'urn'.!  off  (turnip  sa'ca  i'  The  sarrade  drives  tli<  If  f  t  pv  rrror  more  ot  !*•««  to 
7'  i. .  .it  is  a  ft'  ■  t  •  .!  I  x  t  Ii  r  p<  ripii'-ral  l>lti  r  r  in  c  pfr,t  v  Ii  i'  Ii  tna  kes  t  h'-  initial  location  of  t  ii<-  I  a  r  pet  nr.  a  P“  n  peer  t  at  t, 

]t  slows  in..  nc!  t  r\r  r  ff  tarpo:  W  In  n  YOK  li'-a  !  compensation  and  verpener  at'  turn'"!  on  aftm  tlm  sacra'!'  t!  < 
fust  two  i'fi<\-s  l;a\r  a  transient  off'  ot  (to  Ilf  r  r  1ft  srigence  run  rlitrinp  the  sacralf  but  inlnb.it  \(>!i  am!  I  "a  I 
<■>  tnp<  tts.tti'.n  until  after  «acca.|<>  completes 

lie  -1  slew  s  i  Ii--  riff-'  t  s  of  t  rue  kmc  "it  It  a  \  nine  it  y  error  signal  Here  sac  cades  are  initiate. 1  if  t  li<«  t  arc  t  fall*- 
otitsfie  a  fixed  distance  ( Imre  1)  from  the  fovea 

1  mal1;.  plf.  slmws  the  efforts  of  control  delay  on  the  system  The  smallest  delays,  applied  uniformly  or 
t"  just  one  contr  1.  « 1  <‘s ;  abth/e  the  system  seriously 

DELAY  CONTI?  OLS 

As  derive)  tie  ^m.itit  pr'dt  t-r  i<  appropriate  f. ,p  a  single  system  control  1  °r  sensing'  delay  In  our  system 
there  "til  1"  dtfhnnc  delay «  reflecting  different  software  actions  (serial  line  plus  YAI  ,]|  software  versus 
\’MI  bus  connection  to  the  eye  motor  controllers,  for  instance'  1  lie  idea  r.f  tlm  Smith  predictor  is  easily 
extended  however 

Independent  Delay  Control 

d  wo  ty  pes  of  contr  . I  were  implemented  using  the  Smith  controller  of  big  1  In  the  first,  the  controllers 
ar*  icn-rant  of  tin  delays  ofotlur  controllers,  and  also  ignorant  of  the  sharing  of  output  variables  hot  weep 
rout  rollers  Each  con t  roller  knows  it  s  own  delay  T.  and  uses  the  following  algorithm  look  ahead  time  7  and 
retnrie  the  predicted  robot  and  control  slates  for  that  time  Apply  the  control  appropriate  for  these  fvtirr 
states  non 

Fig  G  shows  some  sample  effects  of  this  independent  delay-control  strategy  The  system  is  stable  for  certain 
combinations  of  delays,  but  is  unstable  unless  all  the  non-vergence  delays  are  the  same 

Interacting  Delay  Control  and  Noise 

The  independent  delay  control  algorithm  is  not  as  smart  as  it  could  be.  The  short-delav  controls  do  not  look 
into  the  future  as  far  as  the  long-delay  controls,  and  therefore  they  do  not  anticipate  the  effects  of  slower 
controls  This  effect  shows  up  when  long-delay  and  short-delay  controls  affect  each  otic  r’s  output,  either 
directly  or  through  the  kinematic  chain  The  reason  (lie  verge  reflex  can  run  with  different  delay  and  not 
destabilize  die  independent  delay  control  system  is  that  no  other  control  (barring  saccade)  affects  the  right 
camera  s  pan  velocity,  and  panning  is  at  the  etui  of  the  kinematic  chain  Assume  each  controller  knows  its 


8 


(!• ' 


I  t  l:  • :  i  ■  1  i  .1  N'lu'ii'*'  vr|  i  it\  «  r  I  f  ■  i  ttakinc  with  saccades  for  position  centr'd  tracking  is  stilt'  I  to 

st  "a  :  \  stat'  posit  i  tp  <t  r  r  i  1  ■  t  Add  vt.  t  c'-ik  .  and  also  change  It '‘ad  kinematics  ( unknown  to  an  \  rent  toll*  jc  t  front  a 
'  sj  If  l,'  .It’  p>  ■ur':\  tollo  Ho  |.  sj.  ,-  [(,)>,  ,j  s  ropftciir  a  It  >n  of  pa  n,  til: .  and  opt  i'  a.\>  s  J  }| .  chani;‘  d  C'ottt' i  r  \  It  a- 
Inti'  <!ii  > 


in')  ( b ) 


Figure  5'  (a)  The  no-dolax  cont roller  applied  to  the  system  with  a  constant  delay  of  one  tick  in  all  controls  Idealh 
tins  graph  should  be  a  dclaxed  version  of  Fig  2(d).  ( >.)  The  no-delay  controller  applied  with  zero  delay  in  all  controls 
except  tracking,  which  has  a  delay  of  one  tick 


9 


*  nr 


f  'ipurr  (>  |  a )  lo  be  compared  with  hie  2(d)  and  fie  5(a)  The  Smith  predictor  with  independent  control  it 

ft  a  Me  with  urn  (or  m  con  t  roller  dela\  s  (l>)  Independent  cont  rol  also  if  stable  with  ver  pence  cont  rol  delay  different  I  e  i 
Saccades  induce  transients  but  the  system  is  still  stable  even  i(  vergence  delay  different  (dl  System  is  unstable  i(  a 
non-verpen(  e  control,  here  \  OR\  has  different  delay  front  other  non-verRence  controls 


in 


own  delay  T.  and  the  delavs  r, f  all  t li*»  otle  r  cont  roll-  i s  in  the  s'--!  {S}  that  shat*-  an  output  with  i’  limn 
each  rout  roller  can  use  the  follow  ing  (  mt<-  ract  me  controls)  algorithm  Lord  ahead  Ihr  nntrimiini  Hr  In  v  1 ' 
of  oov  controller  in  {5}  and  rctnrrc  ihr  predicted  robot  and  control  states  for  that  time  Apply  Ihr  control 
appropriate  fc »■  these  future  states  at  (possibly  future)  tune  M-  7  'I  his  algorithm  successfully  copes  with  a 
different  i.hlay  fe.r  each  e-,mro!  (Tig  Tat 

An  easy  implementation  of  this  algorithm  that  loses  some  flexibility  is  simply  to  increase  the  delay  of  all 
controls  that  share  an  output  to  he  the  maximum  delay  of  any  of  their  number  and  apply  t  lie  independent 
delay  control  algorithm  Then  all  controls  in  the  set  look  ahead  as  far  a=  tin  !r  slowest  member,  and  act  a* 
the  current  moment  The  resultant  slowing  of  fast  control.,  is  of  course  suboptimal  when  they  do  riot  have 
to  act  in  concert  with  slow  controls 

Figures  7  and  R  show  s-une  experiments  with  interacting  delay  control,  and  introduce  stochastic  disturbances 
in  the  inputs  and  delays.  The  system  is  robust  against  sensor  noise,  or  varying  uncertainty  in  target  location 
The  preliminary  conclusion  is  that  the  system  destabilizes  with  unpredictable  delays  when  the  outputs  are 
changing  relatively  fast,  but  (of  course)  is  less  susceptible  to  unpredictable  delays  if  the  control  outputs  am 
only  chancing  slow  ly 


DISCUSSION  AND  FUTURE  WORK 


SIMULATION  AND  REALITY 

The  goals  for  the  simulator  were  to  pr  vide  a  kinematic  and  imaging  model  fairly  close  |o  that  of  the 
Rochester  robot  The  model  has  no  dynamics,  but  neither  does  the  robot  from  the  point  of  view  of  (In' 
applications  programmer,  the  current  robot  and  motor  control  soft  war®  hides  this  level  The  simulator  does 
seem  adequate  to  illustrate  th®  characteristics  of  different  styles  of  control  and  to  demonstrate  the  qualitative 
behavior  resulting  from  control  interaction,  delays,  and  various  forms  of  uncertainty  As  the  sophistication 
of  the  control  technology  at  Rochester  increases,  a  useful  simulator  would  have  to  incorporate  increasingly 
sophist  rated  mode  Is 

Likewise  the  simulator's  exterior  world  and  image-processing  model  is  simple,  consisting  of  a  single  point 
w  hose  image  is  instantaneously  and  reliably  ( if  noisily)  found.  To  some  extent  this  is  also  rpalist  ic,  since  it  re¬ 
flects  the  capability  of  frame-rate  feature  detection  [RroSR],  but  it  ignores  the  existence  of  more  sophisticated 
operations  or  those  with  longet  time-constants 

Simulation  is  likely  to  remain  a  basic  tool  in  a  real-time  robotics  laboratory,  but  as  tlr  control  and  visual 
environment  gets  sophisticated  the  simulations  become  slow  and  costly  The  advent  of  cheap  real-time 
hardware  makes  it  increasingly  practical  to  replace  simulations  with  real-world  experiments,  which  are  more 
likely  to  yield  relevant  results 

COMPARISON  WITH  PRIMATE  GAZE  CONTROL  MODELS 

Because  of  its  experimental  accessibility,  the  simplicity  of  the  plant  involved,  and  the  diverse  collateral 
knowledge  about  the  visual  system,  the  gaze  control  system  is  the  best-studied  biological  sensorimotor 
control  system  The  animal  model  most  relevant  to  our  robotic  work  is  the  primate.  because  of  the  close 
relationship  of  visual  attention  w  ith  fixation  that  arises  with  foveal  (i  e  narrow-angle,  high-resolution)  vision 
Gaze  control  in  the  cat  and  rabbit  (and  frog)  is  significantly  different. 

Knowledge  of  the  primate  ga/e-control  system  might  help  provide  insight  to  robot  designers,  and  if  the  right 
hardware  were  available  robotic  equipment  might  he  used  to  implement  computational  models  of  gaze  contr  d. 
thus  providing  an  experimental  facility  complementary  to  the  usual  psychophysical  and  neuroscientific  ones 
The  work  described  hpre  is  not  yet  dedicated  to  modeling  biological  systems,  but  nonetheless  cotnparisiott' 


II 


( a ' 


TipurP  7:  (a)  'I  lie  interacting  conlrol  algorithm  dealing  successfully  with  a  mixed  set  of  delays  Here  the  longest 
non-vcrgence  delay  is  three  t irk^ .  and  the  resultant  behavior  is  that  of  a  system  whose  non-verpenre  controls  have  a 
uniform  delay  of  that  amount  (b)  Sensor  noise  (uniformly  distributed  disturbance  of  the  target  (x,y)  location  in  each 
eye  with  a  =  0.0J  in  each  dimension)  does  not  affect  stability,  but  causes  excursions  larger  than  its  <r  through  t h *• 
interaction  of  tracking  and  verging  (c)  Here  with  probability  .1  a  control  signal  is  delivered  one  tick  early,  and  with 
probability  it  is  delivered  one  tick  la'e  The  c^ctem  is  on  the  x'erge  of  instability,  (d)  V\ith  same  probabilities  as 
in  (c).  more  disturbances  happen  to  occur  early  in  the  sequence  when  outputs  are  changing  rapidly,  destabilmng  the 
system 


12 


Figure  8  fa)  Continuing  from  tlie  previous  figure,  tiie  previous  sensor  noise  is  added  to  the  system  along  with  the 
previous  stochastic  de|a\s  the  system  is  stable  (b)  Here  there  is  no  noise  (other  than  peripheral  blurring),  but  the 
target  model  is  wrong  The  target  is  moving  approximately  perpendicular  to  the  robot  s  motion  instead  of  remaining 
static  The  error  periodicity  of  10  ticks  is  interesting  (c)  Here  the  situation  is  as  in  (b),  but  the  target  is  moving 
faster,  and  toward  the  robot  As  it  gels  close  (he  controls  cannot  respond  fast  enough  and  the  system  destabibres 


arc  inevitable.  nmnsinc.  and  possibly  useful  line  section  i1-  a  very  hri<T  and  admittedly  s<b'ti\r-  sampling 
from  the  immense  and  rich  ( 1  e  confusing  and  contradictory )  literature  on  gazn  and  he  ad  com  rol  m  he  >!'  gt  ;i' 
systems  It  seems  fair  to  say  that  most  of  these  systems  interact,  and  that  it  is  vers  difhcult  to  lay  down 
hard  and  fast  rules  about  what  individual  systems  ran  and  cannot  achieve 

Pursuit  and  Opto-Kinotir  Reflex 

The  Opto-Kinetic  Reflex  (OKR)  causes  the  eyes  to  follosv  a  motion  of  the  full  visual  field,  and  is  driven 
(to  first  order)  by  "retinal  slip",  or  optic  floss  In  primates  the  OKR  comes  in  tsvo  stapes,  a  faster  (direct) 
and  a  slosver  (indirect),  with  the  direct  being  more  dominant  in  man  The  smooth  pursuit  mechanism  is  t'> 
track  small  targets,  and  is  often  described  as  being  driven  by  fovea!  retinal  slip  Thus  these  two  facilities 
are  similar,  and  there  is  some  thought  that  the  direct  part  of  the  OKR  response  is  just  the  smooth  pursuit 
system  [Col8f>]. 

The  situation  with  smooth  pursuit  is  anything  Rut  simple,  however.  It  seems  to  be  possible  to  pursue  exfra- 
foveal  targets  smoothly  Smooth  eye  movements  cannot  normally  be  induced  without  a  smoothly-moving 
stimulus,  but  they  persist  after  a  target  disappears,  tints  arguing  that  some  form  of  prediction  can  excite 
the  response  [F>kS3]  Smooth  pursuit  gain  drops  with  stimulus  velocity.  Last,  smooth  pursuit  in  monkeys 
seems  to  he  driven  (in  a  large  fraction  of  individuals)  not  jus!  by  velocity  error  but  also  by  position  and 
accelerat  mil  errors  l  ints  a  model  such  as  Voting's  (see  below)  that  suggests  a  reconstructed  target  velocity 
is  the  control  input  (rather  than  a  sensed  optical  flow)  could  be  augmented  with  a  broader  range  of  error 
signals  [l.MTS't) 

The  simulator  has  implemented  both  velocity  control  and  position  control  with  predictable  results  (compare 
Fig  3(h)  with  Ilg  4(h))  Without  position  feedback,  the  system  matches  velocity  and  relies  on  saccades. 
which  take  place  when  position  error  goes  over  a  threshold,  for  position  control  There  seems  no  advantage 
to  this  implementation  unless  optic  flow  velocity  can  be  sensed  directly,  as  opposed  to  position  For  instance, 
if  motion  blur  could  be  directly  sensed,  it  would  make  a  direct  optic-flow  velocity  signal  Of  course  analysis 
of  a  particular  motion-blur  track  could  yield  its  centroid  or  endpoints,  bringing  us  back  to  position  control 

Vergence  and  Saccades 

1  he  primate  vergence  system  is  rather  slow,  and  coupled  to  the  focussing  (accommodative)  systems  and  the 
saccadic  system  Vergence  and  accomodation  are  coupled  pairwise,  and  the  "near  triad"  is  a  reflex  made  up 
of  these  three  systems,  in  which  focus  and  vergence  are  both  driven  in  the  proper  direction  and  faster  than 
normal  when  a  saccade  from  close  to  distant  target  (or  the  reverse)  is  made  [Mi!S5] 

Work  with  the  Rochester  robot  has  concentrated  on  "gross  vergence",  mediated  through  disparity  computed 
between  full-field  images  with  variants  of  the  cepstral  filter  [OPSOj  The  simulator  described  here  is  driven 
by  horizontal  disparity  between  the  left  and  right  target  images  In  the  simulator,  (which  does  not  include 
focus)  the  cooperation  of  vergence  and  saccades  is  achieved  simply,  by  the  device  of  letting  imaging,  disparity 
calculation,  and  vergence  reflex  run  during  saccades.  This  method  may  or  may  not  be  nonbiological  (as  usual 
there  is  some  dispute  about  the  amount  of  visual  processing  that  goes  on  during  saccades).  Its  practical 
disadvantage  is  that  it  is  inefficient:  It  is  just  as  easy  to  have  t lie  saccade  control  both  eyes.  The  only  reason 
the  current  simulator  does  not  run  this  way  is  that  it  is  less  interesting 

The  saccadic  system  has  a  longer  delay  titan  smooth  pursuit  (120ms  as  opposed  to  50  ms),  reflecting  its 
higher-level  control  origins  It  can  move  the  eye  at  300  to  400  degrees/second.  It  is  often  modeled  as  a 
sampled-data  system,  kept  stable  by  a  latency  and  trigger  mechanism  that  inhibits  its  firing  again  before  the 
system  has  settled  In  our  robot  system,  saccades  should  not  be  needed  for  position  control  during  tracking, 
and  thus  will  be  associated  with  shifts  of  attention,  or  at  least  of  visual  resource  commitment. 

In  the  experiments  shown,  the  maximum  saccade  speed  was  limited  but  the  maximum  speeds  for  other 
reflexes  were  not  (compare  the  .1  rad/tick  saccade  rate  in  Fig  3(a)  with  the  .3  rad/lick  speed  of  the 
tracking  and  vergence  m  Fig  2(d)  Clearly  the  control  should  not  he  allowed  to  command  unrealistic 


14 


sp  s .  and  til'’  r-dativ*'  pi  rengt  hs  of  tin-  outputs  must  be  adjusted.  In  our  simulation,  tin  si  riot  1  \  "  \r  f: 
tir  dominant''  implementation  of  saecades  and  of  tracking  is  almost  certainly  an  exagc‘-i  at  ion  r,f  tin-  ocular 
dominance  effects  in  primates.  Still,  from  a  practical  point  of  view  it  means  that  the  necessary  low-lew-; 
vision  computations  do  not  need  to  be  carried  out  in  both  eyes  simultaneously 

Thp  Vestibule*- Ocular  Reflex 

The  Vest ibulo-Ocular  Reflex  (YOR)  stabilizes  gaze  by  counteracting  commanded  head  movements  with  eye 
movements  It  is  the  fastest  visual  reflex,  with  a  delay  of  only  approximately  16  milliseconds  It  is  an 
open-loop  control,  in  the  sense  that  vestibular  sensor  output  is  converted  to  eye  muscle  input  and  delivered 
through  a  path  of  approximately  three  synapses  It  can  be  a  high  gain  control  (gain  approximately  1)  it 
can  often  exactly  cancel  out  head  motion  effects  The  VOR  being  open  loop,  there  is  a  general  problem  of 
how  it  internally  models  the  system  it  is  controlling 

Research  on  the  VOR  has  addressed  the  geometrical  aspect  of  its  modelling:  the  conversion  of  sensor 
signals  in  the  coordinate  systems  of  the  semicircular  canals  to  effector  signals  for  the  variously-placed  eye 
muscles.  Robinson  [Rc>h86]  models  the  geometrical  transformations  as  3x3  matrices  operating  on  3-vectors 
Changing  matrix  components  can  accomplish  adaptation,  and  the  adaptation  can  be  driven  by  stimuli 
such  as  retinal  slip  (indicating  a  failure  of  the  reflex)  without  explicitly  modelling  t lie  sensorimotor  system 
I’ellionisz  (PelSri .rRSg]  uses  tensors  to  model  the  differing  transformation  properties  of  the  sensory  and  motor 
vectors  and  transformations,  and  addresses  the  problem  of  underdetermined  control  of  the  many  muscles 
that  accomplish  eye  and  head  movements  by  the  relatively  small  number  of  sensor  dimensions 

Thr  YOR  s  input  originate?;  in  the  linear  and  angular  accelerometers  of  the  otolith  organs  and  semicircular 
canals  They  have  very  short  time  constants,  but  the  YOR  operates  correctly  for  slow  velocities  dins  leads 
to  the  postulation  of  a  "velocity  storage  mechanism"  that  integrates  the  output  of  the  accelerometers  and 
makes  the  resulting  velocity  signal  available  for  control  (e  g  [R CSS] ) . 

Other  YOR  work  addresses  its  time-dependent  behavior:  its  gain  and  phase-lag  characteristics  under  different 
conditions  (e  g  several  papers  in  [BJSojh  Much  of  the  VOR's  behavior  can  be  explained  as  parameter 
variation  among  its  gain.  bias,  and  time  constants.  Miles  tl  ai  [MOL85]  develop  a  multi-channel  model 
to  explain  VOR's  ability  to  cope  with  the  frequency-dependent  output  characteristics  of  the  sensor?  with 
frequency-select ive  adaptation  properties  of  the  YOR  itself,  and  with  other  adaptive  properties  of  the  YOR 
This  work  presents  explicit  transfer  functions  for  the  semicircular  canals,  the  oculomotor  plant,  the  velocity 
storage  mechanism,  and  the  neural  channels  that  convert  head  velocity  estimates  to  motor  outputs  The 
channel  model  is  linear  and  can  he  stated  as  a  lumped-parameter  linear  system,  but  the  channels  make  it 
easier  to  identify  which  gains  must  be  changed  to  reduce  system  errors. 

A  basic  aspect  of  the  YOR  is  its  adaptability  The  reflex  adapts  over  time  to  changes  in  the  optical  system 
(eg  artificially  induced  dysmetria)  [RohS.fl],  The  YOR  interacts  with  other  reflexes  and  tlm  stimuli  that 
evoke  them.  For  example,  large-field  rotations  that  elicit  the  OKR  have  an  interesting  efTect  If  they  are 
slow,  they  bias  the  YOR  (and  the  opto-kinetic  system)  in  the  same  direction,  which  tends  to  cancel  the 
movement  efTect  If  they  are  fast,  they  induce  effects  in  the  opposite  direction,  which  may  be  interpreted  as 
ignoring  the  movement  efTect  |Col85],  VOR  gain  can  be  depressed  from  1.0  to  0.1  by  training  that  involves 
no  visual  input  (subject  imagines  tracking  a  target  attached  to  head  while  moving  head  in  the  dark),  and  i= 
likewise  significantly  affected  by  verbal  instructions  and  other  seemingly  unrelated  activities  (such  as  mental 
arithmetic)  [ J 085] 

Adaptation  and  modeling  can  come  together  in  VOR  behavior  that  adapts  to  repetitive  patterns  (a  perhaps 
familiar  example  is  disembarking  from  a  longisli  sailing  journey).  One  way  to  achieve  this  capability  is 
through  a  "pattern  storage"  mechanism  that  effectively  produces  and  uses  a  model  of  the  outside  world 
Some  workers  are  attracted  to  this  idea,  others  seem  to  think  it  is  unnecessary  and  are  explicable  by,  for 
instance,  channel  adaptation 

hat  has  all  this  to  do  with  a  robotic  YOR7  Many  of  the  issues  mentioned  above  can  be  made  to  vanish 


15 


We  may  know  (lie  relation  of  the  sensor  output  to  the  desired  motor  output  if  we  d '"-id'-  to  model  tie-  red.  < 
and  head  kinc  mat  ics  arcurately  (In  fact  in  the  simulation,  the  rohotie  NOR  makes  se\ eral  approximat  i  m 
including  a  "spherical"  geometry  for  (lie  camera  rotation  axes,  a  small-angle  approximation,  and  others  )  We 
can  sense  velocities  directly  or  even  actively  monitor  the  relevant  control  signals  we  need  to  cancel  T  In  fun¬ 
damental  issues  that  still  need  significant  work  involve  adaptation  and  interaction  Adequate  understanding 
of  these  issues  would  not  only  give  the  robot  system  the  efficiency  exhibited  by  natural  systems,  but  could 
mean  that  such  exercises  as  accurate  kinematic  modeling  would  become  unnecessary. 

Head  Control 

There  is  less  written  on  head  control  than  on  gaze  control,  but  a  good  recent  collection  of  work  exists  [PH'-v 
There  are  various  head  stabilization  reflexes,  some  tied  to  optical  stimulation  The  relation  of  bead  eontrG 
strategies  to  the  evolution  of  particular  brain  mechanisms  and  t lie  existence  of  foveate  vision  is  explor'd 
by  Roucoux  and  Crommelinck  [RC88]  Some  fairly  detailed  biomechanical  head  models  exist,  and  head 
movements  have  been  investigated  from  the  point  of  view  of  optimal  control  theory  Head  movements  can 
be  quite  rapid  (600-700  degrees/second)  and  are  part  of  normal  long-distance  saccades  in  primates  Time 
the  saccadic  and  head-control  system  work  together  to  achieve  gaze  redirection  There  has  been  some  work 
here  (e.g  (GuiK-q)  indicating  that  head  movements  can  take  place  at  differing  times  relative  to  saccades 
Typically,  they  lead  or  lag  depending  on  whether  the  target  location  is  predictable  or  not 

This  coupling  of  head  and  eye  movements  is  clearly  more  sophisticated  than  the  compensatory  reflex  imple¬ 
mented  in  tin  simulation,  which  is  not  coupled  to  saccades  at  all  and  which  must  lag  eye  movements  sinee  jt 
is  only  driven  by  eye  positions.  Thus  more  work  needs  to  be  done  if  we  are  to  achieve  the  increased  rapidity 
of  gaze  redirection  that  arises  when  boiii  head  and  eyes  are  moved  in  a  coordinated  way. 

Another  Model  of  Delay  Control 

The  control  scheme  implemented  in  this  simulation,  the  Smith  predictor,  differs  from  a  scheme  seemingly 
first  proposed  in  a  gaze-control  context  by  Young,  taken  a  step  further  by  Robinson,  and  used  recently  in 
robotic  gaze-control  for  an  agile,  two-eyed  robotic  head  at  Harvard  University  [C F 88] . 

Young  [YouTT]  wanted  to  explain  how  smooth  pursuit  avoided  instability  in  the  presence  of  two  difficulties 
that  apply  if  tracking  is  modeled  as  a  pure  negative  feedback  system.  First,  the  error,  and  thus  control, 
signal  is  zero  when  accurate  tracking  is  achieved;  this  should  send  eye  velocity  transiently  to  zero  Second, 
tracking  performance  is  better  than  it  should  be  given  the  delays  in  the  control  loop  and  the  time  constants 
of  the  processes  His  proposal  is  that  the  system  tracks  not  the  retinal  image,  but  a  neural  signal  that 
corresponds  to  target  motion  (in  the  world). 

In  1971  (for  a  recent  reference,  applied  to  saccadic,  tracking,  and  limb  control,  see  [Rob88])  Robinson 
proposed  a  mechanism  to  implement  Young's  idea.  In  the  negative  feedback  system  the  eye  velocity  is  Rd 
back  and  subtracted  from  the  target  velocity  (with  some  delay).  If  the  eye  is  in  the  process  of  tracking,  then 
the  target  velocity  is  the  sum  of  the  eye  velocity  (with  respect  to  the  head)  and  the  target's  retinal  velocity 
(its  velocity  with  respect  to  the  eye)  But  the  latter  is  just  the  error  signal  resulting  from  negative  feedback 
Thus  an  estimated  target  velocity  signal  can  be  constructed  by  positively  feeding  back  the  commanded  eye 
motion  into  the*  control  loop)  delayed  to  arrive  at  the  proper  time  to  combine  with  the  error  term  produced 
by  negative  feedback.  This  mechanism  not  only  provides  a  signal  based  on  the  target's  true  motion,  but  it 
cancels  the  negative  feedback  and  thus  removes  the  possibility  of  oscillations. 

Robinson's  scheme  is  related  to  the  Smith  controller  shown  in  Figure  1  in  the  following  way.  In  Figure  1, 
the  signal  at  E  is  an  error  signal,  and  the  one  at  D  is  a  difference  of  error  signals  that  is  zero  when  perfect 
track’ng  r  taking  place  This  difference  of  errors  is  a  delayed  (but  consistent)  error  signal  that  is  added  to 
the  predicted  error  signal  in  the  non-delayed  path  C  The  controller  in  Figure  1  tries  to  drive  errors  to  zero 
To  chancp  Figure  1  to  Robinson's  scheme,  delete  path  C  and  remove  the  modelled  world  and  sensor  from 
the  lower  half  of  the  block  diagram  Then  path  B  carries  the  simulated  plant,  not  the  simulated  error  Path 
E  still  contains  error,  hut  path  D  now  contains  a  prediction,  or  reconstruction,  of  the  world  stale  Thus 


16 


t ho  controller  imw  must  treat  the  signal  at  I)  as  a  set  point  to  be  achieved  through  open-loop  metl.Hs.  IK,t 
as  an  error.  Robinson  proposes  parameter  adaptive  control  (in  the  form  of  two  related  gains)  to  provide 
adaptalive  capability  should  the  open  loop  yield  the  wrong  results 

Th.ere  are  thus  some  similarities  between  the  two  schemes,  but  the  underlying  control  philosophies  arp  rather 
different  In  pancular.  losing  the  power  of  negative  feedback  is  a  large  sacrifice  that  the  roboticist  may 
not  need  to  make.  The  Smith  predictor  control  system  keeps  the  advantage  of  feedback  control  (running 
on  the  modelled  world  and  plant).  There  are  many  methods  of  estimation,  observation,  and  prediction  of 
world,  sensor,  and  plant  used  in  modern  control  theory,  and  thus  the  Smith  model  allows  for  flexibility  in 
the  assumptions  underlying  its  predictions. 

FUTURE  WORK 

We  plan  to  supply  more  quantitative  model  parameters,  and  to  try  to  model  the  spatial  and  temporal  scales 
that  actually  apply  in  the  laboratory.  Sensitivity  analysis  will  be  undertaken  to  quantify  the  effects  of  various 
disturbances,  especially  the  problem  of  unpredictable  delays. 

We  plan  to  integrate  some  of  the  existing  Kalman  filtering  tracking  utilities  [Bro89.BF88]  to  perform  es¬ 
timation  of  the  target's  state  Also  we  may  explore  estimation  techniques  [Gel73,Ber7G.Eyk7d]  instead  of 
simulation  techniques  to  predict  the  state  of  the  plant. 

The  simulated  system  can  support  other  relevant  aspects  to  the  control  problem,  including  the  important 
one  of  adapting  m  changes  in  the  plant  In  other  work,  we  have  implemented  the  MIT  rule",  which  is 
a  gradient  descent  method  similar  to  back-propagation  learning  in  neural  nets,  to  learn  part  of  the  robot 
head  geometry.  In  a  way  this  learning  system  acts  like  another  control  system,  with  inputs  the  discrepencies 
between  expected  and  observed  target  motions  given  eye  motions,  and  outputs  are  parameters  to  the  modeled 
plant  (in  this  case,  lengths  of  links  in  the  head  kinematic  chain). 

Implementation  of  an  increasingly  sophist icated  gaze  control  system  on  the  Rochester  robot  should  take 
place  over  the  next  few  years.  We  anticipate  substituting  a  Butterfly  Parallel  Processor  with  multiple  input 
and  output  ports  for  the  central  controller  of  the  system. 


Acknowledgements 

This  work  is  funded  by  the  DARPA  U.S.  Army  Engineering  Topographic  Laboraties  Grant  DACA76-85- 
C-0001  and  tlm  Air  Force  Systems  Command  (RADC,  Griffiss  AFB.  NY)  and  Air  Force  OSR  Contract 
F30G02-83-C-000S.  which  supports  the  Northeast  Artificial  Intelligence  Consortium.  The  assistance  of  Prof 
J.  Michael  Brady.  Dr  Hugh  Durrant- Whyte,  Dr  Ron  Daniels,  and  the  RRG  research  and  administrative 
staff  is  gratefully  acknowledged 


References 

(Ber7G]  Dimitri  1  Bertsekas.  Dynamic  Programming  and  Stochastic  Control  Academic  Press.  1976. 

[BF88]  Y.  Bar-Shalom  and  T.E  Fortmann  Tracking  and  Data  Association.  Academic  Press,  1988 

[BJ85]  A.  Berthoz  and  G  Melvil)  Jones.  Adaptive  Mechanisms  in  Gaze  Control:  Facts  and  Theories 
Elsevier,  1985. 

[BO88]  D  H  Ballard  and  A  Ozcandarlr.  Real-time  kinetic  depth.  In  Second  Ini.  Conf.  on  Computer 
Vision.  November  1988. 


17 


C  M  Brown  and  R  Rimey  Cinematics.  Coordinate  Systems  and  Cornerstone  for  The  Rrrhrst'i 
Robot  'Irchnica!  Report  255.  University  of  Rochester.  September  198k 


[Rro88] 

[BroSfd 

[CF8S] 

[Col85] 

[Eck83] 

[EykTl] 

[Ge]73] 

[CJ 

[JB85] 

[LMT85] 

[Mar79] 

[Mil85] 

[MOL85] 

[OP89] 

[Pe)85] 

[PP88] 

[PR88] 

[RC85] 


C.  M  B  rown  The  Rochester  Robot.  Technical  Report  257,  I'niversity  of  Rochester.  September 
1988. 

C  M  Brown  Kalman  filtering  for  tracking  and  control  In  DARPA  Image  Understanding  Work¬ 
shop.  J unc  ] 989. 

J.  J  Clark  and  N.  J.  Ferrier  Modal  control  of  an  attentive  vision  system.  In  Second  Int.  Joint 
Conference  on  Computer  Vision,  November  1988. 

II  C'oHewijn  Integration  of  adaptive  changes  of  ihe  optokinetic  reflex,  pursuit  and  the  vestilml  '- 
ocular  reflex  In  A  Berthoz  and  G.  Mclvill  Jones,  editors,  Adaptive  Mechanisms  in  Gaze  Control. 
Elsevier,  1985. 

R  Eckmiller  Neural  control  of  fovea!  pursuit  versus  saccadic  eye  movements  in  primates  -  single- 
unit  data  and  models.  IEEE  Trans,  on  Syst..  Man.  and  Cyber  ,  SMC-13(5):980  -  989.  Sept. /Oct 
1 983 

Pieter  EykhofT  System  Identification :  Parameter  and  State  Estimation  Wiley  and  Sons.  1974 
Arthur  C.  Gelh.  Applied  Optimal  Estimation.  The  MIT  Press,  1973 

D.  Guitton  Eye-head  coordination  in  gaze  control  In  B.  W  Peterson  and  F.  J  Richmond,  editors. 
Control  of  Head  Movement.  Oxford  University  Press.  1988 

G.  Mclvill  Jones  and  A.  Berthoz.  Mental  control  of  the  adaptive  process  In  A  Berthoz  and 
G  Mclvill  Jones,  editors.  Adaptive  Mechanisms  m  Gaze  Control.  Elsevier.  1985. 

S  G  Lisberger.  E.  J.  Morris,  and  L.  Tychsep  Visual  motion  processing  and  sensory-motor 
integration  for  smooth  pursuit  eye  movements.  In  A.  Berthoz  and  G.  Melvill  Jones,  editors, 
Adaptive  Mechanisms  in  Gaze  Control.  Elsevier,  1985. 

J  E.  Marshall.  Control  of  Time-Delay  Systems.  Peter  Peregrinus  Ltd.,  1979. 

F.  A  Miles  Adaptive  regulation  in  the  vergence  and  accommodation  control  systems  In  A. 
Berthoz  and  G  Melvill  Jones,  editors.  Adaptive  Mechanisms  in  Gaze  Control ,  Elsevier,  1985. 

T.  A.  Miles.  L  M  Optican,  and  S  G.  Lisberger  An  adaptive  equalizer  model  of  the  primate 
vestibulo-ocular  reflex.  In  A.  Berthoz  and  G.  Melvill  Jones,  editors,  Adaptive  Mechanisms  in  Gaze 
Control.  Elsevier.  1985. 

T  Olson  and  R.  Potter  Real-time  vergence  control.  In  Computer  Vision  and  Pattern  Recognition 
J9S9.  June  1989. 

A.  J.  Pellionisz.  Tensorial  aspects  of  the  multidimensional  approach  to  the  vestibulo-oculomotor 
reflex  and  gaze  In  A  Berthoz  and  G.  Melvill  Jones,  editors,  Adaptive  Mechanisms  in  Gaze  Control 
Elsevier,  1985. 

A  J.  Pellionisz  and  B  W.  Peterson.  A  tensorial  model  of  neck  motor  activation  In  B.  W.  Peterson 
and  F.  J  Richmond,  editors,  Control  of  Head  Movement ,  Oxford  University  Press,  1988. 

B  W.  Peterson  and  F.  J.  Richmond.  Control  of  Head  Movement.  Oxford  University  Press,  1988 

T.  Raphan  and  B.  Cohen.  Velocity  storage  and  the  ocular  response  to  multidimensional  vestibular 
stimuli  In  A  BeVhoz  and  G.  Melvill  Jones,  editors,  Adaptive  Mechanisms  in  Gaze  Control, 
Elsevier.  1985. 


8 


[RC88] 

[Rob85] 

[RobSS] 

[Smi57] 

[Srru58] 

[Vou77] 


A  Roucottx  and  M  Crommelinck,  Control  of  head  movement  during  visual  orientation  In  B.  W 
Peterson  and  F.  J.  Richmond,  editors,  Control  of  Hoad  Movement,  Oxford  University  Press,  198' 

D.  A.  Robinson  The  coordinates  of  neurons  in  the  vestibulo-ocular  reflex.  In  A  Berthoz  and 
G.  M  el v ill  Jones,  editors.  Adaptive  M  echanisms  tn  Gaze  Control ,  Elsevier,  1985 

D  A.  Robinson.  Why  visuomotor  systems  don’t  like  negative  feedback  and  how  they  avoid  it.  In 
M.  A.  Arbib  and  A.  R.  Hanson,  editors,  Piston,  Brain,  and  Cooperative  Computation,  MIT  Press 
1988. 

O.  J.  M.  Smith.  Closer  control  of  loops  with  dead  time.  Chemical  Engg.  Prog.  Trans.,  53(5):217- 
219,  1957. 

O.  J.  M  Smith  Feedback  Control  Systems.  McGraw-Hill.  1958. 

L.R.  Young  Pursuit  eye  movement  -  what  is  being  pursued?  Dev.  Neurosct.:  Control  of  Gaze  by 
Brain  Stem  Neurons.  1:29-36,  1977. 


19 


